Hybrid Fuzzy C-Means Clustering Algorithm Oriented to Big Data Realms
نویسندگان
چکیده
A hybrid variant of the Fuzzy C-Means and K-Means algorithms is proposed to solve large datasets such as those presented in Big Data. The algorithm sensitive initial values membership matrix. Therefore, a special configuration matrix can accelerate convergence algorithm. In this sense, new approach proposed, which we call Hybrid OK-Means (HOFCM), it optimizes parameter. This consists three steps: (a) generate set n solutions an x dataset, applying algorithm; (b) select best solution basis for generating optimized matrix; (c) resolve dataset with C-Means. experimental results four real one synthetic show that HOFCM reduces time by up 93.94% compared average standard It highlighted quality was reduced 2.51% worst case.
منابع مشابه
OPTIMIZATION OF FUZZY CLUSTERING CRITERIA BY A HYBRID PSO AND FUZZY C-MEANS CLUSTERING ALGORITHM
This paper presents an efficient hybrid method, namely fuzzy particleswarm optimization (FPSO) and fuzzy c-means (FCM) algorithms, to solve the fuzzyclustering problem, especially for large sizes. When the problem becomes large, theFCM algorithm may result in uneven distribution of data, making it difficult to findan optimal solution in reasonable amount of time. The PSO algorithm does find ago...
متن کاملA Fuzzy C-means Algorithm for Clustering Fuzzy Data and Its Application in Clustering Incomplete Data
The fuzzy c-means clustering algorithm is a useful tool for clustering; but it is convenient only for crisp complete data. In this article, an enhancement of the algorithm is proposed which is suitable for clustering trapezoidal fuzzy data. A linear ranking function is used to define a distance for trapezoidal fuzzy data. Then, as an application, a method based on the proposed algorithm is pres...
متن کاملoptimization of fuzzy clustering criteria by a hybrid pso and fuzzy c-means clustering algorithm
this paper presents an efficient hybrid method, namely fuzzy particleswarm optimization (fpso) and fuzzy c-means (fcm) algorithms, to solve the fuzzyclustering problem, especially for large sizes. when the problem becomes large, thefcm algorithm may result in uneven distribution of data, making it difficult to findan optimal solution in reasonable amount of time. the pso algorithm does find ago...
متن کاملA Hybrid Time Series Clustering Method Based on Fuzzy C-Means Algorithm: An Agreement Based Clustering Approach
In recent years, the advancement of information gathering technologies such as GPS and GSM networks have led to huge complex datasets such as time series and trajectories. As a result it is essential to use appropriate methods to analyze the produced large raw datasets. Extracting useful information from large data sets has always been one of the most important challenges in different sciences,...
متن کاملHybrid Fuzzy C-Means Clustering Technique for Gene Expression Data
The challenging issue in microarray technique is to analyze and interpret the large volume of data. This can be achieved by clustering techniques in data mining. In hard clustering like hierarchical and k-means clustering techniques, data is divided into distinct clusters, where each data element belongs to exactly one cluster so that the out come of the clustering may not be correct in many ti...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Axioms
سال: 2022
ISSN: ['2075-1680']
DOI: https://doi.org/10.3390/axioms11080377